Local Geometry of One-Hidden-Layer Neural Networks for Logistic Regression

نویسندگان

  • Haoyu Fu
  • Yuejie Chi
  • Yingbin Liang
چکیده

We study the local geometry of a one-hidden-layer fully-connected neural network where the training samples are generated from a multi-neuron logistic regression model. We prove that under Gaussian input, the empirical risk function employing quadratic loss exhibits strong convexity and smoothness uniformly in a local neighborhood of the ground truth, for a class of smooth activation functions satisfying certain properties, including sigmoid and tanh, as soon as the sample complexity is sufficiently large. This implies that if initialized in this neighborhood, gradient descent converges linearly to a critical point that is provably close to the ground truth without requiring a fresh set of samples at each iteration. This significantly improves upon prior results on learning shallow neural networks with multiple neurons. To the best of our knowledge, this is the first global convergence guarantee for one-hidden-layer neural networks using gradient descent over the empirical risk function without resampling at the near-optimal sampling and computational complexity.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Credit Risk Measurement of Trusted Customers Using Logistic Regression and Neural Networks

The issue of credit risk and deferred bank claims is one of the sensitive issues of banking industry, which can be considered as the main cause of bank failures. In recent years, the economic slowdown accompanied by inflation in Iran has led to an increase in deferred bank claims that could put the country's banking system in serious trouble. Accordingly, the current paper presents a prediction...

متن کامل

Prediction of the deformation modulus of rock masses using Artificial Neural Networks and Regression methods

Static deformation modulus is recognized as one of the most important parameters governing the behavior of rock masses. Predictive models for the mechanical properties of rock masses have been used in rock engineering because direct measurement of the properties is difficult due to time and cost constraints. In this method the deformation modulus is estimated indirectly from classification syst...

متن کامل

Prediction of breeding values for the milk production trait in Iranian Holstein cows applying artificial neural networks

The artificial neural networks, the learning algorithms and mathematical models mimicking the information processing ability of human brain can be used non-linear and complex data. The aim of this study was to predict the breeding values for milk production trait in Iranian Holstein cows applying artificial neural networks. Data on 35167 Iranian Holstein cows recorded between 1998 to 2009 were ...

متن کامل

Evaluation of effects of operating parameters on combustible material recovery in coking coal flotation process using artificial neural networks

In this research work, the effects of flotation parameters on coking coal flotation combustible material recovery (CMR) were studied by the artificial neural networks (ANNs) method. The input parameters of the network were the pulp solid weight content, pH, collector dosage, frother dosage, conditioning time, flotation retention time, feed ash content, and rotor rotation speed. In order to sele...

متن کامل

Study of Pedotransfer Functions Multivariate regression, MLP and RBF to estimate CEC for Soils of North Ahvaz

To estimate the Cation Exchange Capacity (CEC), indirect manner used of Pedotransfer Functions (PTFs). CEC is one of the important soil fertility factors, and not measured directly because it is costly and time consuming. Thus, used from regression equations between easily and non-easily soil properties. The purpose of this research, is develop the PTFs for CEC, with use of easily available soi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1802.06463  شماره 

صفحات  -

تاریخ انتشار 2018